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Abstract 


All  failure  detection  methods  are  based  on  the  use  of  redundancy,  that 
is  on  (possible  dynamic)  relations  among  the  measured  variables.  Conse¬ 
quently  the  robustness  of  the  failure  detection  process  depends  to  a  great 
degree  on  the  reliability  of  the  redundancy  relations  given  the  inevitable 
presence  of  model  uncertainties.  9-^ft*~Ehis  paper ^we^address^fhe  problem  of 
determining  redundancy  relations  whicTT  are  optimally  robust  in  a  sense 
which  includes  the  major  issues  of  importance  in  practical  failure  detection 
and  which  provides  us  with  a  significant  amount  of  intuition  concerning  the 
geometry  of  robust  failure  detection.  — a— i 


hoS  been  proved 

Hi  tHKbI-C  I0.lease  and  sale;  its 

attribution  is  unlimited. 


f  £B  o  1  1984 


This  research  was  supported  in  part  by  The  Office  of  Naval  Research  under 
Grant  N00014-77-C-0224  and  by  NASA  Ames  and  NASA  Langley  Research  Centers 
under  Grant  NGL-22-009-124. 

Presently  with  Schlumberger-Doll  Research,  P.0.  Box,  307,  Ridgefield, 
Conn.  06877. 

* 

Also  affiliated  with  the  M.I.T.  Laboratory  for  Information  and  Decision 
Systems . 


'X-M 


raw 


Also  affiliated  with  the  M.I.T.  Electric  Power  Systems  Engineering 
Laboratory . 


TMij  mjtt.it  hers  approved 
IjX  piL  .*c  v  ai.R  ♦aio;  tlm 

dLstrlbUicj  to  ualLnit^d. 


84  01  23  00  8 


-2- 


1.  Introduction 


In  this  paper  we  consider  the  issue  of  robust  failure  detection.  In 
one  way  or  another  all  failure  detection  methods  generate  signals  which  tend 
to  highlight  the  presence  of  particular  failures  if  they  have  actually 
occurred.  However,  if  any  model  uncertainties  have  effects  on  the  obser¬ 
vables  which  are  at  all  like  those  of  one  or  more  of  the  failure  inodes,  these 
will  also  be  accentuated.  Consequently  the  problem  of  robust  failure  de¬ 
tection  is  concerned  with  generating  signals  which  are  maximally  sensitive 
to  some  effects  (failures)  and  minimally  sensitive  to  others  (model  errors) . 

The  initial  impetus  for  our  approach  to  this  problem  came  from  the 
work  reported  in  [5,  13]  which  document  the  first  and  to  date  by  far  most 
successful  application  and  flight  testing  of  a  failure  detection  algorithm 
based  on  advanced  methods  which  use  analytic  redundancy.  The  singular 
feature  of  that  project  was  that  the  dynamics  of  the  aircraft  were  decomposed 
in  order  to  analyze  the  relative  reliability  of  each  individual  source  of 
potentially  useful  failure  detection  information. 

In  [2]  we  presented  the  results  of  our  initial  attempt  to  extract  the 
essence  of  the  method  used  in  [5,  13]  in  order  to  develop  a  general  approach 
to  robust  failure  detection.  As  discussed  in  that  reference  and  in  others 
(such  as  [3,  7-9]),  all  failure  detection  systems  are  based  on  exploiting 
analytical  redundancy  relat  ons  or  (generalized)  parity  checks »  These  are 
simply  functions  of  the  temj  oral  histories  of  the  measured  quantities  which 
have  the  property  that  they  are  small  (ideally  zero)  when  the  system  is 
operating  normally.  In  (2]  we  present  one  criterion  for  measuring  the  re¬ 
liability  of  a  particular  redundancy  relation  and  use  this  to  pose  an 
optimization  problem  to  det<  rmine  the  most  reliable  relation.  In  [3,  19]  we 
present  another  method  whicl  has  some  computational  advantages  not  found 


'.'A*  V  * 


in  the  approach  described  in  12]* 


In  this  paper  we  describe  the  major  results  of  [2,  3 ,  19].  In  the 
next  section  we  review  the  notion  ol  analytic  redundancy  for  perfectly 
known  models  and  provide  a  geometric  interpretation  which  forms  the  start¬ 
ing  point  for  our  investigation  of  robust  failure  detection.  Section  3 
addresses  the  problem  of  robustness  using  our  geometric  ideas,  and  in  that 
section  we  pose  and  solve  a  first  version  of  the  optimum  robust  redundancy 
problem.  In  Section  4  we  discuss  extensions  to  include  three  important 
issues  not  included  in  Section  3:  scaling,  noise,  and  the  detection/robust 
ness  tradeoff. 
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2.  Redundancy  Relations 


Consider  the  noise- fret  discrete-time  model 


x(k+l)  =  Ax (k)  +  Bi  (k) 
y (k)  =  Cx  (k) 


(2.1) 

(2.2) 


where  x  is  n-dimensional ,  u  is  m-dimensional ,  y  is  r-dimensional ,  and  A,  B, 
and  C  are  perfectly  known.  A  redundancy  relation  for  this  model  is  some 
linear  combination  of  presei  t  and  laqqed  values  of  u  and  y  which  should  be 
identically  zero  if  no  changes  (i.e.  failures)  occur  in  (2.1) ,  (2.2).  As 
discussed  in  [2,  3,  19],  redundancy  relations  can  be  specified  mathemati¬ 
cally  in  the  following  way.  The  subspace  of  (p+1) r-dimensional  vectors 
given  by 

r 

C 

i  i 

(2.3) 


G  A  u)  w* 


CA 


CA 


=  0 


is  called  the  space  of  parii  y  or  redundancy  relations  of  order  g.  The  reason 
for  this  terminology  is  the  followinq.  Suppose  that  u)  6  G.  Then  (2.1)  - 

(2.3)  imply  that  if  we  part  on  <o  into  (p+1)  subvectors  of  dimension  r 


W  *  [W*  ,  .  .  .  ] 

0  p 


(2.4) 


then  at  any  time  k 


i-1 


r(k)  =  LlQ  w’  [y(k-P+i)  -  ^  CA1^"1  Bu(k-P+j)]  =  0  (2.5) 

The  quantity  r(k)  is  called  a  parity  check .  A  simpler  form  for  (2.1) 

(which  we  will  use  later)  c,  n  be  written  in  the  case  when  u  =  0  (or,  equiva¬ 
lently,  if  the  effect  of  th<  inputs  are  subtracted  from  the  observations 
before  computing  the  parity  check) .  In  this  case 
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si 


IS 
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y(k-p) 

r(k)  =  to'  y  (k-p+1) 


(2.6) 


To  continue  our  development,  let  us  as:  ume  that 


w  /  0 
P 


(2.7) 


Let  us  denote  the  components  of  io  as 


"i  *  '“il . "ir1 


(2.8) 


Since  at  least  one  element  of  u)  is  nonz  *ro,  we  can  normalize  u)  so  this 

D 


component  has  unity  value.  In  order  to  Llli strate  several  points#  let  us 


assume  that  the  first  component,  a)  =  1.  in  this  case  (2.5)  can  be  re- 

r  ^ 


written  as 


yl(k)  =  •  ilo  wil  yi(k'P+i)  -  i'o  £S2  “'is  Vs(k-P+i> 


p  i-1  *-*-1 

+  Ll0  L  ca1  3  Bu(k-p+j)]  =  0 


(2;9) 


There  are  two  very  important  interpretations  of  (2.9).  The  most 


obvious  is  that  the  right-hand  side  of  this  equation  represents  a  synthetic 


measurement  which  can  be  directly  compared  to  y^(k)  in  a  simple  comparison 


test.  The  second  interpretation  of  (2.9)  is  as  a  reduced-order  dynamic 


model.  Specifically  this  equation  is  nothing  but  an  autoregressive-moving 


average  (ARMA)  model  for  y^(k).  (From  the  point  of  view  of  the  evolution 


of  y^  according  to  (2.9)#  y2',*,'yr  and  t*le  components  of  u  are  all  regarded 


as  inputs).  This  second  interpretation#  allows  us  to  make  contact  with  the 


numerous  existing  failure  detection  methods.  Typically  such  methods  are 


based  on  a  noisy  version  of  the  model  (2.1),  (2.2)  representing  normal 


system  behavior  together  with  a  set  of  deviations  from  this  model 
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representing  the  several  failure  modes.  Rather  than  applying  such  methods 
to  a  single,  ail-encompassimj  model  as  in  (2.1),  (2.2),  one  could  alterna¬ 
tively  apply  the  same  techniques  to  individual  models  as  in  (2.9)  (or  a 
combination  of  several  of  these) ,  thereby  isolating  individual  (or  specific 
groups  of)  parity  relations.  For  example,  this  is  precisely  what  was  done 
in  [5,  13].  The  advantage  of  such  an  approach  is  that  it  allows  one  to 
separate  the  information  provided  by  redundancy  relations  of  differing 
levels  of  reliability,  something  that  is  not  easily  done  when  one  starts 


with  the  overall  model  (2.1),  (2.2)  which  combines  all  redundancy  relations 


In  the  next  two  section  ;  we  address  the  main  problem  of  this  paper, 
which  is  the  determination  o'  optimally  robust  redundancy  relations.  The 
key  to  this  approach  is  the  observation  that  G  in  (2.3)  is  the  orthogonal 
complement  of  the  range  Z  of  the  matrix 

fc  1 


(2.10: 


Thus  (assuming  u  =  0  or  that  :he  effect  of  u  is  subtracted  from  the  obser¬ 
vations)  a  complete  set  of  i  idependent  parity  relations  of  order  p  is  given 
by  the  orthogonal  projection  of  the  window  of  observations  y(k) , 


y (k— 1 ) , . . . ,y (k-p)  onto  G. 


i 


•V 


vw 


3.  An  Angular  Measure  of  Robustness 


Consider  a  model  containing  imperfcctl  '  known  parameters  T),  process 


noise  w  and  measurement  noise  v: 


x(k+l)  =  A (H) x (k)  +  B(H)u(k)  +  w(k • 


(3.1) 


y(k)  =  C(n)x(k)  +  v(k) 


(3-2) 


where  H  is  a  vector  of  unknown  parameters  a  id  where  the  matrices  A,  B,  C 


and  the  covariances  of  w  and  v  are  function;  of  r).  Let  K  denote  the  set 


of  possible  values  which  n  can  take  on.  In  their  work  (21  Chow  and  Willsky 


used  the  following  line  of  reasoning.  If  t  le  parameters  of  the  system  were 


known  perfectly  and  if  there  were  no  proces  ;  or  measurement  noises ,  then 


according  to  (2.5)  we  could  find  a  vector  u, •  =  [u>*  f . . .  foj*  ]  and  a  vector 

0  p 


y  ~  ILq t  * • • with 


u!  -  .  £  w! 

1  j=l+l  1 


(3.3) 


so  that 


=  £  w!y (k-p+i)  -  .£  U!  u(k-p+l)  ■  0 

x=0  l  1  1=0  l 


(3.4) 


In  the  uncertain  case,  what  would  seem  to  mike  sense  is  to  minimize  some 


measure  of  the  size  of  r(k).  For  example  one  could  consider  choosing  w  and 


U  that  solve  the  minimax  problem 


lr(k)  1‘ 


(3.5) 


x0(n) 


w  =  i 


Here  the  expectation  is  taken  for  each  valun  of  ri  and  assuming  that  the 


system  is  at  particular  operating  joint,  i.*s.  that  u(k)  =  u  and  that  x  (n) 


is  the  corresponding  set  point  value  of  the  state.  This  criterion  has  the 


w 


v\- 


interpretation  of  finding  the  approximate  parity  relation  which ,  at  the 
specified  operating  point,  produces  the  residual  with  the  smallest  worst- 
case  mean-square  value  when  no  failure  has  occurred. 


Let  us  make  several  comments  concerning  the  procedure  just  described. 

In  the  first  place  the  optimization  problem  (19)  is  a  complex  nonlinear 

programming  problem.  Furthermore,  the  method  does  not  easily  give  a  sequence 

of  parity  relations  ordered  by  their  robustness.  Finally  the  optimum  parity 

relation  clearly  depends  upon  the  operating  point  as  specified  by  u  and 

o 

Xo<n>.  In  some  problems  this  may  be  desireable  as  it  does  allow  one  to 
adapt  the  failure  detection  algorithm  to  changing  conditions,  but  in  others 
it  might  be  acceptable  or  preferable  to  have  a  single  set  of  parity  rela¬ 
tions  for  all  operating  conditions.  The  approach  developed  in  this  paper 
produces  such  a  set  and  results  in  a  far  simpler  computational  procedure. 

To  begin,  let  us  focus  cn  (3.1),  (3.2)  with  u  =  w  =  v  =  0.  Referring 
to  the  previous  discussion,  ve  note  that  it  is  in  general  impossible  to 
find  parity  checks  which  are  perfect  for  all  possible  values  of  n.  That  is, 
in  general  we  cannot  find  a  subsnace  G  which  is  orthogonal  to 

Z (n)  =  Range 

for  all  n. 

What  would  seem  to  make  iense  in  this  case  is  to  choose  a  subspace  G 
which  is  "as  orthogonal  as  possible"  to  all  possible  Z(p).  Several  possible 
ways  in  which  this  can  be  done  are  described  in  detail  in  [3].  In  tl  is 
paper  we  focus  on  the  one  approach  which  leads  to  the  most  complete  picture 
of  robust  redundancy  and  which  is  computationally  the  simplest.  To  do  this, 
however,  we  must  make  the  assumption  hat  K,  the  set  of  possible  values  of 


C(H) 

c  (n)  a  (t.) 
,c(n)A(r)p 


ri  is  finite.  Typically  what  this  would  involve  is  choosing  representative 
points  out  of  the  actual,  continuous  range  of  parameter  values.  Here  "repre¬ 
sentative"  means  spanning  the  range  of  possible  values  and  having  density 
variations  reflecting  any  desired  weightings  on  the  likelihood  or  importance 
of  particular  sets  of  parameter  values.  However  this  is  accomplished,  we 
will  assume  for  the  remainder  of  this  paper  that  t|  takes  on  a  discrete  set 
of  values  H=1,...,L,  and  will  use  the  notation  A^  for  A(H=i) ,  Z^  for  Z(H=i), 
etc. 


To  obtain  a  simple  computational  procedure  for  determining  robust  re¬ 
dundancy  relations  we  first  compute  an  average  observation  subspace  Zq  which 
is  as  close  as  possible  to  all  of  the  ,  and  we  then  choose  G  to  be  the 
orthogonal  complement  of  Z^.  To  be  more  precise,  note  first  that  the  are 
subspaces  of  possibly  differing  dimensions  (dim  Z^  =  \  )  embedded  in  a  space 

of  dimension  N  =  (p+l)r.  We  will  find  it  convenient  to  use  the  same  symbols 

Z_,...,Z  to  denote  matrices  of  sizes  NxV. ,  i=l,...,L,  whose  columns  forn 
1  L  a 

orthonormal  bases  for  the  corresponding  subspaces.  Letting  M  = 
we  define  the  NxM  matrix 


Z  = 


(3.7) 


Thus  the  columns  of  Z  span  the  possible  directions  in  which  observation 
histories  may  lie  under  normal  conditions. 

We  now  suppose  that  we  wish  to  determine  the  s  best  parity  checks  (so 
that  dim  G=s) .  Thus  we  wish  to  determine  a  subspace  Zq  of  dimension  N-s. 
The  optimum  choice  for  this  subspace  is  taken  to  be  the  span  of  the  (not 
necessarily  orthogonal)  columns  of  the  matrix  Zq  which  minimizes 

Hz  -  zo||F  (3*8 

subject  to  the  constraint  that  rank  r  r  N-s.  Here  II  •  II  p  denotes  the 


Frobenius  norm: 
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!{ d|| ?,  =  ?  ?  Id. . 

1  nF  j  l  ij 


(3.9) 


There  are  several  impoi  tant  reasons  for  choosinct  this  criterion,  one 
being  that  it  does  produce  a  space  which  is  as  close  as  possible  to  a 
specified  set  of  directions.  A  second  is  that  the  resulting  optimization 
problem  is  easy  to  solve.  in  particular,  let  he  singular  value  decomposi¬ 
tion  of  Z  [14,  15]  be  given  by 

Z  =  U  I  V  (3.H 

where  U  and  V  are  orthogonal  matrices ,  and 


0  0  . 

I  =  \  *  0 

0  ’a  * 

n 


Here  <  9  •  *  <  are  the  singular  values  of  Z  ordered  by  magnitude. 

Note  we  have  assumed  N  <  M.  If  this  is  not  the  case  we  can  make  it  so 
without  changing  the  optimur  choice  of  Zq  by  padding  Z  with  additional 
columns  of  zeros.  It  is  ref dily  shown  [17,  18]  that  the  matrix  Zq  minimiz 


ing  (3.8)  is  given  by 


z  =  u 
o 


9  V 


Moreover,  since  the  columns  of  U  are  orthonormal,  we  immediately  see  that 
the  orthogonal  complement  of  the  ranje  of  Zq  is  given  by  the  first  s  left 
singular  vectors  of  Z^,  i  e.  the  first  s  columns  of  U.  Consequently 

G  *  [u.  • «  •  u  j  (3.3 

1 .  .  s 

and  are  the  optimum  redundancy  relations. 

1  s 

There  is  an  alternative  interpretation  of  this  choice  of  G  which 


provides  some  very  useful  insight.  pacifically,  recall  that  what  we  wish  to 

do  is  to  find  a  G  whose  columns  are  is  orthogonal  as  possbile  to  the  columns 


of  the  Z.;  that  is,  vc  would  like  to  choose  G  to  make  each  of  the  matrices 
1 

Z^G  as  close  to  zero  as  possible.  I  i  fact,  as  shown  in  [3],  the  choice  of 
G  given  in  (3.13)  minimizes 


j(s)  =  ||z;g||f 


yielding  the  minimum  value 

s  2 

J(s)  =  T.  o. 

i=l  i 


(3.14) 


(3.15) 


There  are  two  important  points  to  observe  about  the  result  (3.14), 
(3.15).  The  first  is  that  we  can  now  see  a  straiqhtforward  way  in  which  to 
include  unequal  weightings  on  each  of  the  terms  in  (3.14).  Specifically, 
if  tie  w^  are  positive  numbers,  then 


X  w-  llz’.Gllf;  =  X  II  *w.  Z :g||£ 

1  =  1  1  11  1  F  1=1  "  X  1  mF 


(3.16) 


so  that  minimizing  this  quantity  is  accomplished  using  the  same  procedure 
described  previously  but  with  Z^  replaced  by  iw.  Z^.  As  a  second  point 
note  that  the  optimum  value  (3.17)  provides  us  with  an  interpretation  of 
the  singular  values  as  measures  of  robustness  and  with  an  ordered  sequence 
of  parity  relations  from  most  to  lcv.st  robust. 
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4.  Several  Important  Extensions 

In  this  section  we  address  several  of  the  drawbacks  and  limitations  of 
the  result  of  the  preceding  section  and  obtain  modifications  to  this  result 
which  overcome  them  at  no  fundamental  increase  in  complexity. 


4.1  Scaling 

A  critical  problem  with  the  method  us<*d  in  the  preceding  section  is  that 
all  vectors  in  the  observation  spaces  are  treated  as  being  equally  likely 
to  occur.  If  there  are  differences  in  scale  among  the  system  variables  this 
may  lead  to  poor  solutions  for  the  optimum  parity  relations.  To  overcome 
this  drawback  we  proceed  as  follows.  Suppose  that  we  are  given  a  scaling 
matrix  P  so  that  with  the  change  •  >f  basis 

f.  =  Px  (4.1) 


one  obtains  a  variable  F,  which  is  equally  likely  to  lie  in  any  direction. 

For  example  if  covariance  ana  ysis  has  been  performed  on  x  and  its  covariance 
is  Q,  then  P  can  be  chosen  to  satisfy 


Q  =  P~1(P')”1 


(4.1) 


and  the  resulting  covariance  of  <  is  the  identity. 

As  a  next  step,  recall  that  vhat  we  would  ideally  like  to  do  is  to  choose 
a  matrix  G  so  that 


C. 

1 

V'1  1 

C .  A . 

l  l 

n 

*0 

i 

G* 

x  =  G* 

_L  X 

C.A.p 

1  i_ 

c  nVl 

Li  J 

F,  *  G’c.r 

X 


(4.3) 


is  as  small  as  possible.  In  the  preceding  section  we  considered  all  directions 
in  Z.  *  Range  (C.)  to  be  on  ecual  footinq  and  arrived  at  the  criterion  (4.4) 

i  i 
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Since  all  directions  for  f,  are  on  equal  fo»  tin \,  we  are  led  naturally  to  the 
following  criterion  which  takes  scaling  into  a <  count 

L  2 

j  (s)  =  i;  1 1  c  'g||  (4.4) 

i=l  1  F 

Using  the  result  [17]  cited  in  the  pr  »vio  is  section  we  see  that  to 
find  the  Nxs  matrix  G  (with  orthonormai  column  5)  which  minimizes  J  (s)  we 
must  perform  a  singular  value  decompos itio  1  of  the  matrix 

c  =  [c.  :  c. . .  :cT  ]  =  u  >.:  v  (4.5) 

1  •  2  .  .  Ij 

?  2  2 

where  oT  <  <  .  ..<  ^  and  U  =  [u/.u  *....*  u  I.  Then  u_  is  the  best  parity 

12  N  1.  2,  .  N  1 
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relation  with  u  as  its  measure  of  robustness,  is  the  next  best,  etc., 
and  J*  (s)  is  given  by  (3.15).  Finally,  in  ant  ieipation  of  the  next  subsection, 
suppose  that  we  use  the  stochastic  interpretation  of  i9  i.e.  that 


eicf.1!  =  i  (4.6) 

In  this  case  if  we  define  the  parity  check  vector 

u  =  G 1 C , C  (4.7) 

i  l 

then 

Elll^ll2]  -  llejcllp  <«.M 

4.2  Observation  and  Process  Noise 

In  addition  to  choosing  parity  relations  which  are  maximally  insensitive 
to  model  uncertainties  it  is  also  important  t(  choose  relations  which  suppress 
noise.  Consider  then  the  model 


5 

s 


x  (k+1 )  =  A.x(k)  +  D.wik) 
y  (k)  *  C^x  (k)  +  v  (k) 

where  w  and  v  are  independent,  zero-mean  white  noise  processes 


i 


-  /-A  A  ■■ 


(4.") 

(4.10) 

with  covariances 


Q  and  R,  respectively. 


U  =  G*  : 


y  (k+p) 


(4.10) 


Then  using  the  interpretation  provided  in  (4.7)  ,  we  obtain  the  following 
natural  generalization  of  the  :riterion  (4.4): 


J(s)  =  Z  E  [  1 1 P  !  |  1 
i=l  1 


(4.11) 


where  denotes  expectation  assuming  that  the  ith  model  is  correct.  Assuming 
that  £(k)  =  Px(k)  has  the  identity  as  its  covariance,  using  the  whiteness  of 
w  and  v,  and  performing  some  aLgebraic  manipulations  we  obtain  (3] 


J(S)  =  T,  llc'Glip  +  f!  S  clip 
i=l 


where  S  is  defined  by  the  following: 


C .  A  ,  D  .  C.b 

x  i  l  l  i 


C.A?-1D.  C.AP_2n. 
11  1  11  1 


Q  =  diag  (Q,...,0)  (p  times) 


(4.12) 


(4.13) 


R  =  diag  (R,...,R)  ((p+1)  times) 


(4.14) 


n  =  T.  5.56:  t  ijR  =  ss1 

i=l  1  1 


(4.15) 


From  (4.12)  we  see  that  the  effect  of  the  noise  is  to  specify  another 


set  of  directions,  namely  the  columns  of  S,  to  which  we  would  like  to  make 
the  columns  of  G  as  close  to  orthogonal  as  possible.  From  this  it  is  evident 


that  the  optimum  choice  of  G  is  computed  by  performing  a  singular  value 
decomposition  on  the  matrix 

[Ci;...:CL:sl  =  U  Z  V  (4.16) 

As  before  (4.16)  provides  a  complete  set  of  parity  relations  ordered  in  terms 
of  their  degrees  of  insensitivity  to  model  errors  and  noise. 

4.3  Detection  Versus  Robustness 

The  methods  described  to  this  point  involve  measuring  the  quality  of 
redundancy  relation  ;  in  terms  of  how  small  the  resulting  parity  checks  are 
undei  normal  operating  conditions.  However,  in  some  cases  one  might  prefer 
to  use  an  alternative  viewpoint.  In  particular  there  may  be  parity  checks 
which  are  not  optimally  robust  in  the  senses  we  have  discussed  but  are  still 
of  significant  value  because  they  are  extremely  sensitive  to  particular 
failure  modes.  In  this  subsection  we  consider  a  criterion  which  takes 
such  a  possibility  into  account.  For  simplicity  we  focus  on  the  noise-free 
case.  The  extension  to  include  noise  as  in  the  previous  subsection  is 
stra i ght  forward . 

The  specific  problem  we  consider  is  the  choice  of  parity  checks  for  the 
robust  detection  of  a  particular  failure  mode.  We  assume  that  the  unfailed 
model  of  the  system  is 

x  (k+1)  «  Au(n)x(k)  (4.17) 

y  (k)  *  C  (n)  x  (k)  (4.18) 

u 

while  if  the  failure  has  occurred  the  model  is 

x(k+l)  =Af(n)x(k)  (4.19) 

y(k)  *  Cf (H)  x(k)  (4.20) 

In  this  case  one  would  like  to  choose  G  to  bt  Mas  orthogonal  as  possible"  to 


L6- 


Zu(n)  and  "as  parallel  as  possible"  to  Z^(n). 

Assume  again  that  n  takes  on  one  of  a  finite  set  of  possible  values,  and 

let  C  .  and  C„.  denote  the  counterparts  of  C,  in  (4.3)  for  the  unfailed  and 
ui  fi  i 

failed  models,  respectively.  A  natural  criterion  which  reflects  our  objective 
is 

L  9  -> 

J(s)  =  min  ;■  {||c; .clip  -  ||c*  G||p  (4.21) 

G'G=I  i=l  U1  F  Fl  F 

If  we  define  the  matrix 


h  =  [c  .:c  :c.0:..:c_T  j 

ul  u2  uL  fl  f  2  FL 


columns 


columns 


J(s)  =  min  tr{G'HSH'G) 
G'G=I 


(4.22) 


(4.23) 


where 


(4.24) 

It  is  straightforward  (see  (3))  to  show  that  a  minor  modification  of  the 
result  in  [17]  leads  to  the  following  solution.  We  perfo -m  an  eigenvector- 
eigenvalue  analysis  on  the  matrix 


HSH’  =  U  A  U' 


w!  ere  U’U  *  I  and 

A  =  diag  (A  , . . . ,  XN> 

with  X  <  X  <  . . .  <  X  and  U  =  [u  .’ 
1  -  2  *  “  N  1* 

is 


[U 


i: 


^U  ] 


VNi 


(4.25) 


(4.26) 


7) 


*.u  ] .  Then  the  optimum  choice  of  G 

•  N 


and  the  corresponding  value  of  (4.23)  is 


J*(s)  =  Y  X. 


(4.28) 


Let  us  make  two  comments  about  this  solution.  The  first  is  that  as  many 
as  of  the  X^  can  be  negative.  In  fact  tie  parity  check  based  on  is 
likely  to  have  larger  values  under  failed  rather  than  unfailed  conditions 
if  and  only  if  X^  <  0.  Thus  we  immediately  see  that  the  maximum  nunber  of 
useful  parity  relations  for  detecting  this  particular  failure  mode  equals 
the  number  of  negative  eigenvalues  of  HSH* .  As  a  second  comment,  let  us 
contrast  the  procedure  we  use  here  with  a  singular  value  deconposition,  which 
corresponds  essentially  to  performing  an  eigenvector-eigenvalue  analysis  of 
HH'.  First,  assume  that  the  first  K  of  the  \  are  negative.  Then,  define 

A  '  "V  A  •  -x2 . °k  *  -\r 


°K+1  =  XK+1"**'0N  *  \» 


(4.29) 


Prom  (4.25)  we  have  that 


HSH’  =  UESEU’ 


where 


£  =  diagfo^, . . . ,0^) 


(4.31) 


Assuming  that  E  is  nonsingular ,  define 


V  *  E  U'H 


(4.32) 


Then  (4.31) ,  (4.32)  imply  that  V  is  S-orthogonal 


VSV*  =  S 


(4.33) 


and  that  H  has  what  we  call  as  S-singular  value  decomposition 


H  *  UEV 


(4.34) 
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